Skip to content

support rl vit lora with vLLM#147

Merged
tastelikefeet merged 3 commits intomodelscope:mainfrom
hjh0119:vit-lora
Apr 9, 2026
Merged

support rl vit lora with vLLM#147
tastelikefeet merged 3 commits intomodelscope:mainfrom
hjh0119:vit-lora

Conversation

@hjh0119
Copy link
Copy Markdown
Collaborator

@hjh0119 hjh0119 commented Apr 9, 2026

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors LoRA configurations across grpo.py, grpo_mm.py, and short_math_grpo.py to better handle text-only and multimodal training scenarios, including enabling tower_connector_lora for multimodal setups. It also includes a minor logical reordering in megatron.py for checking model_keys. The review feedback suggests improving the conciseness and PEP 8 compliance of an inline comment in grpo_mm.py.

@tastelikefeet tastelikefeet merged commit 9b4d0f0 into modelscope:main Apr 9, 2026
1 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants